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ABSTRACT 

We analyse the local variance effect in the standard method for detecting the integrated Sachs- 
Wolfe effect (ISW) via cross-correlating the cosmic microwave background (CMB) with the 
large-scale structure (LSS). Local variance is defined as the systematic noise in the ISW de- 
tection that originates in the realization of the matter distribution in the observed Universe. We 
show that the local variance contributes about 1 1 per cent to the total variance in the standard 
method, if a perfect and complete LSS survey up to z « 2 is assumed. Due to local variance, 
the estimated detection significance and cosmological parameter constraints in the standard 
method are biased. In this work, we present an optimal method of how to reduce the local 
variance effect in the ISW detection by working conditional on the LSS data. The variance 
of the optimal method, and hence the signal-to-noise ratio, depends on the actual realization 
of the matter distribution in the observed Universe. We show that for an ideal galaxy sur- 
vey, the average signal-to-noise ratio is enhanced by about 7 per cent in the optimal method, 
as compared to the standard method. In the framework of our method, it is straightforward 
to correct for the magnification bias coming from gravitational lensing effects. Furthermore 
there is no need to estimate the covariance matrix by Monte Carlo simulations as in the stan- 
dard method, which saves time and increases the accuracy. Finally, we derive the correct joint 
likelihood function for cosmological parameters given CMB and LSS data within the linear 
LSS formation regime, which includes a small coupling of the two datasets due to the ISW 
effect. 

Key words: Cosmology: CMB - Large-Scale Structure - cosmological parameter estimation 



1 INTRODUCTION 



1 2006h. iRaccan elli et all (| 2008|) , iMcE wen et alj (|20a 



The integrated Sachs-Wolfe (ISW) effect JSachs & Woif3ll967h 
is an important probe of th e existence and nature of d ark energy 
terittenden & Turolc 199ff) and the nature of gravity JLue et al.l 
[2004; Zhang 2006b). Spatial curvature also gives rise to an 
ISW effect, but is well constrained to be very close to zero by 
cosmic microwave background (CMB) ex periments such as th e 
Wilkinson Microwave Anisotropy Probe JKomatsu et al.l l2008h . 
However, the detection of the ISW signal remains challenging, 
for it is obscured by primordial fluctuations in the CMB. In 
recent years, substantial effort has been made to detect the ISW 
effect via cross-correlation of the CMB with large-scale struc- 
ture (LSS) surveys, such as optical galaxy and quasar survey s 
(Sloan Digital Sky Survey, 'Adelman-McCarthv et al.! ( l2008h . 
and Two-Micron All-Sky Survey, larrett et al. ( 2 000)), radio 
surveys (NRAO VLA Sky Survey, ICondon et all ( Il998h), and 
X-ray surveys (High Energy Astrophysics Observatory, iBoldtl 
1 119871) ). S uch cross-corr e lation st udies have, for examp l e, bee n 
done by 'Boughn et al." ('l998'), Boughn & Crittenden ('2004), 
iBoughn & Crittenden (2005) , Afshordi et al. (2004) . Rassat et ak 



IPietrobon et al.l (12 006). Fosalba etalJ(l20CG ), Fosalba & Gaztafia^i 
(l2004'),'Vielvaetal. (2006) ^iu & Zhand i2006) . iHo et al.i ( i20oi) 
and Giannantonio et al. ( 20081). just to name a few of them. 

The standard method for detecting the cross-correlation be- 
tween the LSS and the CMB involves comparing the observed 
cross-correlation function with its theoretical prediction for a given 
fiducial cosmological model. The theoretical prediction is by con- 
struction an ensemble average over all possible realizations of the 
universe given the fiducial parameters, including all possible real- 
izations of the local matter distribution. Assuming ergodicity, this 
ensemble average can also be thought of as an average over all pos- 
sible positions of the observer in the Universe ('cosmic mean'). 

However, the ISW effect is created by the decay of the gravi- 
tational potential coming from the structures on the largest scales, 
i.e. from structures that have not yet undergone significant grav- 
itational collapse and are still not decoupled from the expansion 
of the Universe. These largest scales are most affected by cosmic 
variance. Therefore, when comparing the observed (local) cross- 
correlation function to its cosmic mean value, the realization of the 
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matter distribution in our vicinity acts as a source of systematic 
noise in the estimation of tlie cross-correlation, lience leading to a 
biased detection significance, due to cosmic variance. 

In this work, we estimate the contribution of this local vari- 
ance effect to the total variance in the detected signal under the 
simplifying assumption that there is no shot noise in the galaxy 
distribution. We find that the local variance in the detected signal 
amounts to 1 1 per cent in the case of an ideal LSS survey going out 
to about redshift 2 and covering enough volum e to include the larg e 
scales relevant for the ISW. This agrees with ICabre et al.l dlOOTh . 
who compare different methods to estimate the error in standard 
cross-correlation studies. They find that their MCI error, which ig- 
nores the variance coming from the realization of the matter field 
and only considers the variance in the CMB fluctuations, systemat- 
ically underestimates the error by about 10 per cent. 

From the above-mentioned surveys, the local matter distribu- 
tion is known to a certain degree, and hence the local variance ef- 
fect can be reduced by working conditional on that information. 
We present a generic technique of how to include the knowledge 
of the matter distribution into the detection of the ISW via cross- 
correlation, hence reducing the sources of noise to the unknown 
part of the matter distribution and the primordial CMB fluctuations. 
We define the systematic noise that comes from the known part of 
the matter distribution as bias, for it can be removed by working 
conditional on the LSS data. Our method is referred to as optimal 
method, in contrast to the standard method for ISW detection men- 
tioned above. The main idea of the optimal method is to create an 
ISW template from a Wiener filter reconstruction of the LSS. We 
then use this template to detect the amplitude of the ISW template 
rather than of the theoretical cross-correlation function. This makes 
the variance in the estimated amplitude, and hence the signal-to- 
noise ratio both depend on the actual realization of the matter in 
the Universe. For an ideal LSS survey, we show that the average 
variance in the detected amplitude is reduced by 13 per cent in the 
optimal method. Rephrased in terms of the signal-to-noise ratio, 
this reduction of the noise leads on average to a higher detection 
significance by about 7 per cent. As we show in this work, in the 
framework of the optimal method it is straightforward to correct for 
the magnification bia s due to gravitational lensing, as described by 
iLoverde et al.l yOOTJ). Furthermore, there is no need to estimate the 
covariance matrix by Monte Carlo simulations as in the standard 
method. This saves time and increases the accuracy of the method. 

To our knowledge, the only suggested methods for ISW detec- 
tion besides t his wor k that al so do not suffer from the local varianc e 
effect are by IZhand ( l2006ah and iHernandez-Monteagudd JlOOSl) . 
The former involves combining lensing-LSS cross-correlation 
measurements with the ISW-LSS cross-correlation, and t hereby re- 
lies on the nowadays s till-difficult lensing measurements (Hu|200l|; 
IHu & Okamotdl2002l). The latter follows an appro ach very similar 
to ours. However, iHemandez-Monteagudd OOOSf) does not use a 
Wiener filter reconstruction of the LSS distribution but works di- 
rectly on the sphere. In contrast to our work he neglects the shot 
noise, which could be easily included in his analysis, though. In 
this w ork we will go one step further than lHernandez-Monteagudd 
( 120081) and derive the correct way of including the information en- 
coded in the ISW for cosmological parameter estimation. 

Many of the above-mentioned cross-correlation studies have 
attempted to constrain cosmological parameters using a likelihood 
function for the cosmological parameters p given the observed 
cross-correlation function. Just like the detection significance, these 
parameter estimates are biased by local variance. Furthermore, to 
our knowledge, there is no straightforward way of combining the 



likelihood function for the cross-correlation with the likelihoods 
for CMB and LSS data. In this work, we derive the correct joint 
likelihood function P{T, 5g \ p) for cosmological parameters, given 
the CMB map T and the LSS data 5g, from first principles for the 
linear LSS formation regime. This joint likelihood consistently in- 
cludes the coupling between the two datasets introduced by the 
ISW effect, which so far has been neglected in analyses deriving 
cosm ological parameter constraints by combi ning CMB and LSS 
data JTegmark et al.ll2004lSpergel et alj2007h . 

The article is organized as follows. We start by briefly describ- 
ing the integrated Sachs-Wolfe effect in section |2] and explain in 
detail the different stochastic processes that are relevant for our 
analysis and the correction for the magnification bias in section 
[3] In section |4] we review the standard method for detecting the 
ISW via cross-correlation and estimate the contribution of the lo- 
cal variance to the total variance of the detected signal. Section |5] 
is devoted to presenting the optimal method of ISW detection we 
developed, and to comparing it in detail to the standard method. We 
discuss the role of the biasing effect due to local variance in param- 
eter constraints and derive the joint likelihood function P{T, 5g \ p) 
in section |6] Concluding remarks on our work are given in section 

13 



2 THE INTEGRATED SACHS-WOLFE EFFECT 

The effect of decaying gravitational potential fluctuations on the 
CMB is called the integrated Sachs-Wolfe effect and is described 
by 

rvo 
Tisw(ri) = 2 / *' (77, (r?o - J?) fi) drj, (1) 

''vis 

where 77 denotes the conformal time, 7715 and 770 the conformal time 
at last scattering and the present epoch, respectively, and n is the 
direction on the sky. Note that the integral in the above equation 
has to be taken along the backwards ligh t cone. ^ is the gauge in- 
variant Bardeen potential ( lBardeenll98(]f) . which coincides with the 
Newtonian gravitational potential in the Newtonian gauge used in 
this work. The prime denotes the derivative with respect to confor- 
mal time. In order to keep the notation simple we have redefined 
Tisw = (Tisw - To) /To, where Tq = 2.725 K is the temperature 
of the CMB monopole. We will use this convention as well for T 
and Tprim, which will be defined in section[3] 

In Newtonian gauge, Tisw is obtained by applying a linear 
operator V to the present matter density contrast 5m,(??o): 



Tisw = VSmirjo)- 



(2) 



The matter density contrast is defined as 5m{x) = 
[pm{x) — pm] jpra, where pm{x) deuotes the density of 
matter in the Universe at position x, and pm is the background 
matter density. Eq. ^ can be verified by using the perturbation 
equati ons derived by e.g. iKodama & Sasakil ( 119841) or Durren 
( l200lh . 

In order to obtain the expression for the operator V in the 
subhorizon-limit, let us look at the Poisson equation 

A*=^(l + 2)fim<5„, (3) 

where Ho is the present value of the Hubble constant, fi™ = 
pm.o/pcrit.o the present ratio of matter density to critical density, 
z denotes the redshift, and A is the Laplace operator in comoving 
coordinates. From the Poisson equation we obtain 
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*'(fc,^) = 



2fc2 



-ff('7)(l-/W)©('7)5™(fc,'7o) 



(4) 



where A; stands for the absolute value of k, H{rj) is the Hubble 
constant at conformal time rj, f = dln5„i/dlna is the growth 
function, ©(j?) = 5m{k,ri)/5m.{k,rjo) denotes the linear growth 
factor, and we define Fourier transformed quantities by 



5m(fc,0) 



d xe" ' 5m{x, 0). 



(5) 



The expression for the operator V can then be obtained by Fourier 
transforming eq. ^ and inserting it into eq. (TJ. Note, though, that 
we have not used the subhorizon-limit in this work, for eq. (O is 
valid on superhorizon-scales as well.[j 



3 STOCHASTIC PROCESSES 

3.1 Realization of the matter distribution 

In the standard cosmology adopted in this work, there are different 
stochastic processes to be considered. For simplifying the notation, 
let us define 



G{x,C) 



1 



^|27rC 



cxp 



(-ix-'x) 



(6) 



to denote the probability density function of a Gaussian distributed 
vector X with zero mean, given the cosmological parameters p and 
the covariance matrix C = (xx^)' where the averages are taken 
over the Gaussian distribution G{x, C). Note that in general the 
covariance matrix depends on the cosmological parameters, which 
is not explicitly stated in our notation. A daggered vector or matrix 
denotes its transposed and complex conjugated version, as usual. 
Hence, given two vectors a and h, a h^ must be read as the tensor 
product, whereas a* b denotes the scalar product. Note that these 
conventions can still be used for vectors and matrices in function- 
spaces, like, e.g., the matter overdensity field Sm, which is a con- 
tinuous function of the position x. 

During inflation, the matter density perturbations have been 
created from qua ntum fluctuations . This stochastic process was 
close to Gaussian ( Mukhanovll2005l) . permitting to write down the 
probability distribution for the matter density contrast given the 
cosmological parameters p as 



Pi5m\p)=G{5m,S), 



(7) 



where the covariance matrix S = {Sm.Sln) p(^s„^ | p)> depends on the 
cosmological parameters p. The average {■■)p(6„^ \ p) is defined as 
ensemble average over the different realizations of 5m, the index 

^ The correct formula for "P in Newtonian gauge, which also holds on 
superhorizon-scales, can be obtained by differentiating and Fourier trans- 
forming 

*(fc,r;) = exp - / p{k,r)') dr)' ^ ' ° " 



2Ha2 



xX){fc,V)5m(fc,r)o)cxp I / pik,v")dri" \dri' 



and inserting it into eq. (T), instead of the expression for 'I'(fc, rj) in the 

+ 3a^f 
'SaH 



subhorizon-limit, eq. (4). Here, we have defined p{k, rfj = — ^!— 2 



and the linear growth factor ©(fc, T]) = / ■ '^^' , which in general depends 
on the Fourier mode k. 



P(Sm I p) explicitly states which probability distribution the aver- 
age has to be taken over. Given homogeneity and isotropy, we note 
that the Fourier transformation of S is diagonal 



{Sm{k)Sm{k'y)p^s^ I p) = {27TfS{k - k')P{k), 



(8) 



where P{k) is the power spectrum, 5{..) denotes the Dirac delta 
function, and the star is used for denoting complex conjugation. 

The stochastic process due to the inflationary quantum fluc- 
tuations created the angular fluctuations in the CMB, that is, the 
primordial fluctuations originating from the surface of last scatter- 
ing at redshift z — 1100, as well as the integrated Sachs-Wolfe 
effect imprinted by the more local matter distribution at 2 < 2. 
Throughout this work we will assume that the primordial fluctua- 
tions and the ISW are stochastically independent, which is a safe 
assumption (apart from the very large scales), given that they are 
associated with matt er perturbations of very different wavelengths 
JBoughn et alJI 19980 , so that very little intrinsic cross-correlation 
can be expected. In fact, for notational convenience we will use the 
symbol 5m to only denote the local matter distribution at 2 < 2. 
The joint probability distribution for Tisw ~ P5m and the primor- 
dial temperature fluctuations Tprim then factorizes 



P{TlSW,Tprim\p) = P(riSW \p)PiTprim\p), 

with 

-P(7isw \p) = G(risvv, Cisw), 
and 

J^ \J^ prim \ P ) — t.Jl_-iprim,^primJ, 



(9) 



(10) 



(11) 



where we have defined the angular two-point auto-correlation func- 
tion for the fluctuation Tx (X being TSW' or 'prim') 



Cx = {TxT^)p(Tx 



P)- 



(12) 



Again, given homogeneity and isotropy, Cx is diagonal in spheri- 
cal harmonics space 



{(llmO'l'm')p(Tx \p) — Gl Sill 5„ 



(13) 



where Cf is the angular power spectrum of the quantity X, and 
we have used the expansion coefficients of Tx into spherical har- 
monics Ylm, 



X 

0-lm 



dVlTx{n)Yim{n) 



(14) 



where the integral is taken over the sphere. Given that the joint dis- 
tribution P{Tisw, Tprim | p) factorlzcs into two Gaussian distribu- 
tions, the sum T — Tisw + Tprim, which denotes the temperature 
fluctuation of the CMB, is again Gaussian distributed 



P{T\p) = G{T,Ccmb), 
with 



Gci\ 



GlSW + Cprin 



(15) 



(16) 



Given the cosmological parameters, the angular power spectra 
Cp^^, Cp^ and Cf"™ can all be calculated using CMBFAST 
(http : //ascl . net/cmbf ast . html . ISel iak & Zaldarriaga] 
l ll996l) ). CAMB (http : / /camb . i nf o.lLewis et aU ( i200Q) ), or 
CMBEASY (www . cmbeasy . org. lPorai] ( l2005h ). In particular, 



Cisw can be obtained from the three-dimensional matter covari- 
ance matrix S by 



Cis 



rsv 



(17) 
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where we have used that linear transformations of Gaussian random 
variables are again Gaussian distributed, with the covariance matrix 
transformed accordingly (see also iCoorayii2002i) . 



3.2 CMB detector noise 

From CMB detectors, we do not read off the real T as defined in 
the last section, but a temperature where the detector noise Tdet 
has been added. Again this can be modeled as a Gaussian random 
process. 



-P(rdct) — G(rdct, Cdct), 



(18) 



where Cdot denotes the detector noise covariance. This process is 
independent of the process that created the real (noiseless) T, such 
that if we redefine T ^T -\- Tdet to be the temperature we read off 
our detector, we obtain 



p(r|p) = G(r,CcMB + Cd 



(19) 



with CcMB being the covariance matrix of the real (noiseless) 
CMB. 

However, in most of this work we will neglect the detector 
noise, for the ISW is only present on the largest angul ar scales, 
wher e the dominant source of noise is cosmic variance dAfshordil 
|2004). The only part where we include the detector noise will be in 
section[6l where we derive the joint likelihood for the cosmological 
parameters, given CMB and LSS data, for in this likelihood we also 
include smaller angular scales. 



3.3 Shot noise 

Unfortunately, the matter distribution is not directly known, and 
we have to rely on LSS catalogues from which we can try to re- 
construct it. A process to be considered when working with such 
catalogues is the stochastic distribution of the galaxies, which only 
on average follows the matter distribution. Since the galaxies are 
discrete sources from which we want to infer the properties of the 
underlying matter overdensity field, we have to deal with shot noise 
in the galaxy distribution. More specifically, we assume the ob- 
served number Ng{xi) of galaxies in a volume element AV{xi) 
at a discrete position Xi to be distributed according to a Poisson 
distribution 



P{N,{xi)\\{xi)) 



KxiY 



N.ixiY. 



(20) 



Here, \{x) denotes the expected mean number of observed galax- 
ies within AV{x), given the matter density contrast. 



A(x) 



{x)nlAV [l + bSm{x) 



In the above equation, nj 



n: 



(21) 
^ /V denotes the cosmic mean 



galaxy density, with N^' '°* being the total number of galax- 
ies in the volume V. Note that we have added an index V to 
stress that these are the actual (real) number of galaxies present 
in AV^, not the observed number of galaxies Ng, which can be 
smaller due to observational detection limits. The window w{x) = 
$(a;) m{n) denotes the combined selection function <I>(a;) and sky 
mask m{n) of the survey, and b the galaxy bias, which in gen- 
eral depends on redshift, scale and galaxy type. The variance in 
the observed number of galaxies Ng{x) within AV{x) is then 
Ug{x) = {[Ng{x) — \{x)] )jVg = X{x). Here, we have used the 
index Ng on the average to indicate the average over the Poisson 
distribution in eq. l l20t . 



If the average number of galaxies A(a;) is large, the Pois- 
son distribution is well approximated by a Gaussian distribution 
around \{x). For simplicity we will use the Gaussian approxima- 
tion throughout this work. Furthermore we will ignore the depen- 
dence of the noise on Sm{x) by using Ug{x) — w{x) nl AV in- 
stead of the correct noise term crg{x) — X{x), for the latter would 
require a non-linear and iterative approach. Such an approach is 
beyond the scope of this paper, but is also irrelevant for the main 
find ing of this work. H owever, see Kitaura et al. (in preparation) 
and EnBlin et alj ( 120081) for a better handling of the Poisson noise 
and bias variations. 

Since the cosmic mean galaxy density rig is not known, we 
have to estimate it from the observed galaxy counts by 



n]jAV 



Nl 



^^"-(-0' 



(22) 



where Ng°^ is the total number of observed galaxies and the sum 
goes over all the pixels in our volume. 

With the above-mentioned simplifications, we can now work 
with the following linear data model. First we define the the ob- 
served galaxy density contrast at position x to be 



_ Ng{x)-w{x)nlAV 



(23) 



which is the convention used in Kitaura et al. (in preparation). 
Note that this definition differs from the one usuall y used in cross- 
corre lation studies by a factor of ■w{x) (see e.g. IPogosian et al.l 
I2OO5I) . We then write 



5g — R5m + (i, 



(24) 



where e{x) is the additive noise-term that originates in the Poisso- 
nian distribution of Ng{x), and R is the linear response operator 
In the simplest case, R{xi, Xj) = hw{xi) 5ij, but in general R 
maps the the continuous space in which Sm lives onto the discrete 
pixel space of our data 5g, and it can also include the mapping from 
redshift-space onto comoving coordinate space. 

Gravitational lensing introduces a magnificati on bias in the 
observ ed galaxy density contrast, as described by iLoverde et al.l 
1 I2OO7I) . In our data model, it is straightforward to take this effect 
into account by letting 

R5m{n,z) = 'w{h,z)[b5m{r{z)h,z)+'iQ.mHQ{2.^s{z) -1) 



r{z'){r{z) ~ r(z')) 



dz- 

H{z') r{z) 

(1 + z')5m{r{z)n,z')], 



(25) 



where r{z) is the comoving distance corresponding to redshift 2, 
and the slope s of the number count of the source galaxies is defined 
as 



dlogioJV(<m) 
dm 



(26) 



with m being the limiting magnitude and N(< m) being the count 
of objects brighter than m. Note that in order to get the correct 
formula for the magnification bias term in 3 dimensions, we used 
th e Dirac delta fu nction as the normalized selection function used 
bv lLoverde et all W{z, z') = 5{z - z'). 

From the Poisson distribution in eq. l l20t . we see that 
{Sg)Ng ~ RSm, and hence with the above simplifications the noise 
e is Gaussian distributed around zero 



P{e\p) = G{e,N), 



(27) 
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with the noise covariance matrix 

N{xi,Xi) = {e{xi)e{xi))^^ = J^ <5,,. (28) 



4 STANDARD CROSS-CORRELATION METHOD 

4.1 Description 

In this section, we briefly review the standard method for detecting 
the cross-correlation of the CMB wit h the projected galaxy density 
contrast, which was first describ ed by Boughn et al. ( 1998) , but s ee 
for example also lHo et alj ( I2008.) and .Giannantonio et alj ( 120081) ). 
Note that we use the word galaxy density contrast for convenience, 
but the method is of course the same when working with other trac- 
ers of the LSS as mentioned in section[T] 

The theoretical cross-correlation function of two quantities 
X(n) and Y{n) on the sky is defined in spherical harmonics space 
as 



L-i = yO'lmO-lm /all- 



(29) 



The average in the above definition is an ensemble average over all 
possible realizations of the universe with given cosmological pa- 
rameters, i.e. over P{Sm, Sg,T \ p). This is indicated by the index 
'air on the average. We will denote the abstract cross-correlation 
function as a vector in Hilbert space by ^^'^ to simplify the no- 
tation. This can be understood as a vector in pixel space or as a 
vector in ajm-space. Only when evaluating the expressions we de- 
rive, we will choose the representation of the abstract vector ^^'^ 
in spherical harmonics space, (^"^'^ );„;/„/ = C, ' 5;;'(5mm/. In 
the following we will work with the cross-correlation function of 
the projected galaxy density contrast with the CMB temperature 
fluctuations, ^^' '-^^^, in order to reproduce the standard approach 
in the literature. 

The observed projected galaxy density contrast 5j"^°^ for a 
redshift bin centered around redshift Zi in a given direction n on 
the sky is 



Sr'in,z,) 



dzW(z,Zi)5g(n,z) 



= / dzW{z, Zi)[RS,nin, z) + e{n,z)], 



(30) 



where W(z,Zi) denotes the normalized selection function that de- 
fines the ith bin, and Sg is given by eq. ( 1231 ). Note that in many 
cross-correlation studies the normalized selection function "I>(a;) 
of the survey is used to define the bin. However, since later on we 
will consider a perfect galaxy survey covering all the redshift range 
relevant for the ISW, we need to introduce the additional narrow 
selection function W{z, Zi) defining the bin. 

If the LSS survey and the CMB map cover the full sky, it is 
convenient to define an estimator for the cross-correlation function 
of the projected g alaxy de nsity contrast with the CMB in spherical 
harmonics space (Rassatet al. 2006), 



C, 



a, CMB 



21 + 1^^ 



Re\ 



"■lm°-l-n 



CMB . 



(31) 



where a^^ and af^^ are the expansion coefficients of the ob- 
served 5g''°^ and T into spherical harmonics as defined in eq. 
( 114b . The hat has been added to discriminate the estimator of the 
cross-correlation function from its theoretical counterpart Cf' 
In the case that the experiments cover only a part of the sky. 



one has to take into account the effects of mode-coupling when 
working in spherical harmonics space. In this case it is there- 
fore more straightforward to define other estimators for the cross- 
correlati on function, such as averag es over the sphere in real space 
(see e.g. Giaimant o nio et alj 120081) or quadratic estimators as in 
lAfshordi et alj (g004). However, for the statement we will make in 
this work the actual definition of the estimator is not relevant, and 
we find the one defined in spherical harmonics space the most con- 
venient to work with, since a closely related quantity also appears 
within the framework of the optimal detection method presented 
later on in section [5] Again we use the abstract notation ^b-CTshb. 
for the estimator of the cross-correlation ^^ cmb ^^ order to keep 
the notation simple, we will from now on understand ^s-CyiB. ^^^ 
^gCMB ^ ijeing vectors in spherical harmonics-space as well as 
in bin-space, containing the cross-correlation functions for all the 
different bins. 

In the literature, the probability distribution of the above- 
defined estimator ^S'*^^^ around the theoretical cross-correlation 
function ^s.cms j^ usually approximated by a Gaussian, 



'(r-^-«b)=G(e 



9, CMB 



< 



9, CMB 



c 



(32) 



where the covariance matrix of the cross-correlation estimator is 
defined as 

S.CMBi \ /^- ^»iT, ^^ ^»,T, \T 



c 



^|^|9.CMB_^^9.CMB^^^^^)|^^9.CMB_^^9.CMB^^^^^ >all. (33) 

The first question usually addressed in the above-mentioned 
cross-correlation studies is whether a non-zero cross-correlation 
function can be detected at all. To this end one assumes a fidu- 
cial cosmological model, which is used to predict the theoreti- 
cal cross-correlation function and covariance matrix C. In this 
wo rk we use the flat ACDM model with parameter values given 
by iKomatsu et alj J2008l) . table 1: fib/i^ = 0.02265, Oa = 
0.721, h = 0.701, Us = 0.96, r = 0.084, ag = 0.817." The 
covarianc e matrix is usually estimated by Monte Carlo simula- 
tions (see Cabre et al. (2007!) for an overview), or analytically as 
in Afshordi et al. (2004). The analytical prediction is possible in 
the case that the joint probability distribution for the projected 
galaxy density contrast and CMB given the cosmological param- 
eters, P[5^l°\5^j°\TcMB. I p), is Gaussian, which is valid in the 
framework of linear perturbation theory. Here we have used the 
index gi to denote the projected galaxy density contrast of bin i. 
Then the covariance matrix in spherical harmonics space can be 
expressed in terms of two-point correlation functions as 

J r^9i,CMB^9J,CMB ^9i,9j ^^CMB] 



Cl{l,j) = 



{21 + l)/sky 



(34) 



where we have used the auto-correlation power spectrum for the 
CMB, as defined in eq. ( |13b . Cf '^"' contains by definition the 
power coming from the underlying matter distribution plus the shot 
noise. Note that, in principle, cf^^ in the above formula also in- 
cludes detector noise, which we neglect here as discussed in sec- 
tion l3.2l /sky is the fraction of the sky covered by both, the galaxy 
survey and the CMB experiment. In the following we will assume 

/.ky = 1. 

Putting an amplitude or fudge factor Ace in front of the the- 
oretical cross-correlation function ^s-'^^'^ j^y j^^jj^j^ Qjjg ^^^ jjq^ 
find out whether it is possible to detect a non-zero Ace- The index 
'cc' on the amplitude indicates that it is the amplitude of the cross- 
correlation function. Of course this amplitude should be one in the 
fiducial model. However, even if the data are taken from a universe 
in which the underlying cosmology is the fiducial model we will in 
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general not estimate the amplitude to be one. This is due to the dif- 
ferent sources of stochastical uncertainty or noise in the estimate of 
Ace, which we have described at length in section[3l The likelihood 
for the amplitude given the cosmological parameters reads 



'(f 



9,CMB 



Ac 



)-a{t 



,P = 



9,CMB 



^cc S 



9,CMB 



c 



(35) 



A commonly used estimator of the amplitude A^c is the maximum 
likelihood amplitude 



Ac 



e 



g.CMB t 



c-^e 



9,CMB 



£9,CMBt (7-1 £9,CMB 



-igj.CMB ' 



(36) 



where in the second line we have used the representation of the 
cross-correlation functions in spherical harmonics space. The max- 
imum likelihood amplitude is an unbiased estimator (if the under- 
lying probability distribution is Gaussian), hence for the fiducial 
model we have for the average over all cosmic realizations 



\^cc}a 



1, 



(37) 



-,91, CMB 



since (Cf*' )aii ~ Cf'''^'"'° by definition of the latter quan- 

tity. Note that here we have assumed that the data are taken in a 
universe where the underlying cosmology is actually the fiducial 
model. This will be assumed in the rest of the paper as well. 
The variance in Ace is given by 






{Acc)all 1 )all 



9,CMBt i~i-l ^9,CMB\ 



5](2; + i)^cr'^-Vr\M-)q 



9J,CMB 



(38) 



In the standard literature, an estimated significance is given to 
the detection of the amplitude, the estimated signal-to-noise ratio 



^cc 
Cfcc 

(39) 



-^9.7, CMB 



E.(2« + i)E„-c^r^''"c7r^(M-)c,' 



However, since the real signal is Ace = 1, the actual signal-to-noise 
ratio is given by 



\N J cc Cfcc 



(40) 



/^(2; + i)^cr'""Vr^(M-)q 



9j,CMB 



and is therefore independent of the data. 



4.2 Analysis of error-contributions 

In this section we analyse the different sources of noise that con- 
tribute to the total variance in eq. {38}. In order to simplify this task 
we assume that there is no shot noise in the galaxy distribution, that 
is, we set e = in eq. l l24b . which means that the galaxies trace the 
matter distribution perfectly. Furthermore we work with the ideal 
case that we have a galaxy survey that covers the whole sky and 



goes out to redshift 2. With these two assumptions we have a per- 
fect knowledge of the matter distribution Sm relevant for the ISW 
effect. 

For sufficiently narrow bins, the integration kernels for ISW 
and galaxy density contrast are approximately constant over the bin 



and hence a 



ISW(i) 
I'm 



= const (i) X af^. In eqs OU, ^, and l [40l (. 
we can therefore substitute every index gi by the index ISW(z), for 
the constant factor cancels out. Now, if one uses the ISW kernel, 
working with several narrow bins that cover the whole volume rel- 
evant for the ISW effect, is equivalent to working with only one 
bin covering the same volume. This is because the ISW integrated 
over the whole relevant volume is exactly the information about the 
ISW contained in the CMB. Hence one does not gain anything by 
working with bins if using the correct kernel. We outline the proof 
for that in AppendixlAl In what follows, we therefore consider only 
one bin, which significantly simplifies the form of eqs ( |36b , J38b , 
and(l40]l. 

Furthermore we note that, since the ISW is uncorrelated with 



the primordial CMB fluctuations, we have C, 



ISW, CMB 



C' 



The index 'all' now indicates an average over the probability distri- 
bution P(risw, Tprim 1 p) = P(Tisw I p)P(Tprim | p) (cf. sectiou 
O- Under the above assumptions, eq. ( |36b for the estimated ampli- 
tude reads 



Ac 



E,(2' + l)< 



:^ISW,CMB 



'+0^ 



Z^t^ + IJ (jlgW_|_0CMB 



with the variance (ea.l38t 

C(-;,ISW \ 

/ ,(2' + l)^isw , ^CMB I ' 

and the signal-to-noise ratio in eq. l l40t simplifies to 



(l)„-iD^'-" 



(^ISW 



(^ISW I (^CMB ■ 



(41) 



(42) 



(43) 



The signal-to-noise ratio as a function of the maximum summation 
index Zmax for our fiducial model is depicted in the top panel of 
Fig- El for which we have modified CMBEASY in order to obtain 
Ci and C; . There are contributions to the signal-to-noise up 
to roughly I = 100. Note, though, that our assumptions of Gaus- 
sianity of the matter realization 5,n and the assumption of V being 
a linear operator do not hold on small scales where structure growth 
has become non-linear. However, this issue will not be addressed in 
this work and it will not affect our main results, which are due to 
advantages of our method on the very large scales, which are most 
affected by cosmic variance. 

The above estimator for the amplitude is only unbiased when 
averaging over the joint distribution 



(j4cc)all = ((^cc)prim)lSW = 1- 



(44) 



Here we indicate averages over P(Tprim \p) and P{Tisw \p) by 
the indices 'prim' and 'ISW, respectively. This means that both 
the primordial CMB fluctuations and the realization of the local 
matter distribution are included in the error budget. We call the lat- 
ter the local variance, indicating that it originates in the realization 
of the matter distribution in our observed Universe. Let us now es- 
timate the contribution of the local variance to the total variance of 
Ace- To this end we split the variance in eq. i42t into two parts 



(((^cc- 



1 >pH.n)l 
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\\i -^cc ~ (^cc/prim I /prim/ISW 



+ {( (Ace) prin, - l) )l 
2 I 2 



(45) 



where we have defined the contributions to the variance coming 
from primordial CMB fluctuations, and the local variance as o-prim 
and af^^, respectively. Both can be easily calculated, and the second 
contribution turns out to be 



E,(2; + i) 



{cj^-^r 



0-1 



(o? 



2_];(2/ + 1) pCMllj_rriaw 



(46) 



(jCMii_|_piaw ) 

In the bottom panel of Fig. [T] we plot the relative contribution of 
the local to the total variance af^^/a^,, against the maximum I that 
we consider in the analysis, for our fiducial cosmological model. 
For a maximum multipole Imax ~ 100, this relative contribution 
amounts to 



11%. 



(47) 



This estimate agrees with lCabre et al.l 1 1200 7) . who compare differ- 
ent error estimates for the standard cross-correlation method. They 
compare what they call the MCI method, which only takes into 
account the variance in the CMB and ignores the variance in the 
galaxy overdensity, with the MC2 method, which includes also the 
variance in the galaxy overdensity. Both methods rely on perform- 
ing Monte Carlo (MC) simulations of the CMB, and of the galaxy 
overdensity in the case of MC2, and the simulations used to com- 
pare the different error estimates have converged with an accuracy 
of about 5 per cent, as stated in the paper. The result is that, com- 
pared to the MC2 method, the MCI method underestimates the er- 
ror by about 10 per cent, which agrees well with our estimate. 



5 OPTIMAL METHOD 

Since the expected ISW effect is known from our galaxy survey, it 
is possible to find a cross-correlation estimator that does not include 
the local variance in the error-budget, but is unbiased already when 
averaging conditional on the observed galaxy density contrast 5g. 
We will introduce such an estimator in this section. To this end, we 
first derive the posterior distribution P{T \Sg,p) for the CMB tem- 
perature T, given the galaxy data 5g and the cosmological parame- 
ters p. From that we obtain the maximum likelihood estimator for 
the amplitude of the part of Tisw which is known from the galaxy 
survey. Since we assume everything to be Gaussian distributed, this 
maximum likelihood estimator is equivalent to the estimator we ob- 
tain from an optimal matched filter approach. 

Note that a different attempt to make the detection of the ISW 
unbiased by the realization of the local matter distribution was done 
bv lZhand j2006an . It involves comparing CMB-galaxy and lensing- 
galaxy cross-correlation functions, and hence relies on nowadays 
still-difficult lensing measurements. 

Another work which does not suffer from local variance 
is by Hernandez-Monteagudo (2008). He implements an optimal 
matched filter in spherical harmonics space, and finds by numeri- 
cal comparison that it always perf orms better than or equal l y wel l 
as the standard method. However, iHemandez-Monteagudd J2008r) 
works directly on the sphere, without using a Wiener filter recon- 
struction of the LSS, and therefore is slightly suboptimal in ex- 
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Figure 1. Comparison of the average signal-to-noise ratio and variance of 
the optimal method with the ones of the standard method for Zmax = 2. 
Top panel: Average signal-to-noise ratio of the optimal method (solid) and 
signal-to-noise ratio of the standard method (dashed) versus the maximal 
multipole considered in the analysis. Middle panel: Relative improvement 
of the average signal-to-noise ratio in the optimal method. Bottom panel: 
Average relative improvement of the variance in the optimal method (solid) 
and relative contribution of the local variance to the total variance in the 
standai'd method (dashed) 



ploiting the available three-dimensional information on galaxy po- 
sitions. 

In this work, we go one step further than 
iHemandez-Monteagudd i2008l) and derive the joint likeli- 
hood for cosmological parameters, given CMB and LSS data, 
which includes the small coupling of the two datasets introduced 
by the ISW effect (cf. section[6). 



5.1 Derivation of the posterior distribution 

Let us first ask the question what the observed galaxy density con- 
trast tells us about the matter distribution 5m ■ Given the data model 
in eq. l l24t and the noise distribution in eq. l l27t . we know that 



Pi5g\5m,p) 



P{Sg ~ RSm\ 5,n,p) 

G{5g-RSm,N). 



(48) 



Using the probability distribution for Sm given in eq. l|7) as a prior 
for 5,n, we obtain the joint probability distribution P{5g,Sm \ p) = 
P{5g I 5m.,p) P{5m I p) of galaxies and matter distribution 

P{Sg,5m\p) = G{5g-R5m,N)G{5m,S) 

= G{5m~Dj,D)G{5g,RSR' + N), (49) 

where we have defined the matter distribution uncertainty covari- 
ance matrix 

D= (R^N'^R + S'^y^ 



and the response over noise weighted galaxy overdensity 
j = R^N-'Sg, 



(50) 



(51) 



which can be interpreted as the information source of our LSS 
knowledge l lEnfilin et al. 2008). A detailed derivation of the expres- 
sion for the joint probability distribution in eq. l |49l l can be found in 
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Appendix IBI This distribution can be trivially integrated over 5m in 
order to obtain the evidence 

P{5g\p) = G{5„RSR'' + N). (52) 

Therefore the posterior distribution P{5m\5g,p) = 

P(5™,5sip)/P(<59|p)reads 

P{5m \5g,p) = G{5m - Dj, D). (53) 

From this posterior one can directly read off the maximum a- 
posteriori estimator for the matter distribution 8m 



and estimate the amplitude by a maximum likelihood estimator 



Dj = [R^N-^R + S-^)-^R^N-^5g 



(54) 



This is the Wiener filter applied to the galaxy-overdensity 
(Wiener 1950; Zaroubi et al. 1995; Zaroubi 1995; Fisher etal. 
ri995l ; lErdogdu et al.l2004lKitaura & Enfilinl2008l) . We call this es- 
timator a reconstruction of the matter distribution from the galaxy 
survey, hence the symbol 5]^'^ . 

With this knowledge of the matter distribution, let us now find 
the posterior distribution for T — Tisw + Tprim + Tdet, given 
the observed galaxy density contrast 5g. The probability distribu- 
tion for Tisw, obtained from the one for 5m, eq. ( I53t . (note that we 
again use that linear transformations of Gaussian distributed ran- 
dom vectors are again Gaussian distributed; cf. eq. (TTJl), reads 



P(risw \Sg,p) = G(Tisw - r,VDP''). 
where we have defined the ISW template 

T = P5m ■ 



(55) 



(56) 



Since the uncertainty in the reconstructed matter distribution 
is not related to the primordial CMB fluctuations (cf. section [JH ), 
the joint probability distribution for Tisw, Tprim, and Tdct given 



5g factorizes: 



PiTii 



,Taet\5g,p) = PiTisw\5g,p)P{Tprin,\p) 

P(rdct|p). (57) 



Note that in the above equation we have used the fact that the pri- 
mordial CMB fluctuations do not depend on the galaxy distribution. 
We now again use the fact that the sum of stochastically indepen- 
dent Gaussian distributed random variables is again Gaussian dis- 
tributed with the sum of the covariance matrices. We then obtain 
the posterior distribution for T, given the LSS data: 



PiT\5g,p) = G{T~r,C). 



(58) 



Here we have used the probability distributions for Tprim and Tdct, 
eqs dllt and l |18l ), and we have defined the covariance matrix for 
the total noise 

C = PDP'' + Cprim + Cdct . (59) 

As in section [4] we will neglect the detector noise in the rest 
of this section and only include it when deriving the likelihood in 
section |6l However, if needed it can easily be included into the 
following equations by replacing Cprim -^ Cprim + Cdot. 



5.2 Estimation of the ISW amplitude 

We can now ask the same question as before, namely if it is at all 
possible to detect a non-zero amplitude At that we put in front of 
our ISW template in eq. J58t . Again we can write down the likeli- 
hood function for the amplitude 



T^t /^-i_ y",(2« + l)-S; 



Ci 



^'"^-'^ E.i^l+lf^ 



(61) 



where we have defined the estimator CI of the ISW auto- 
correlation function analogous to the cross-correlation estimator in 
eq. linj: 



C[ 



1 \"^ I r |2 

\0.lm\ ■ 



21 + 1^-^' 



(62) 



This maximum likelihood amplitude is again an unbiased estimator, 
but now with respect to the probability distribution conditional on 



X-'T-T/cond — -L, 



(63) 



where the index 'cond' on the average denotes an average over the 
distribution P(Tcmb | Ar,5g,p). 

In other words, we have eliminated the noise component com- 
ing from the realization of the known part of 5m, thus reducing 
the sources of noise to the unknown part of 5m and the primordial 
CMB fluctuations. The variance in At is 

O't = \\^r ^ (^r) cond J /cond 

= (rtC-V)-'=(^(2/-H):^| , (64) 

and we obtain the signal-to-noise ratio 



a\ 



/^(2/ + l)| 



(65) 



P{T\AT,5g,p} = G{T-ArT,G), 



(60) 



Note that the error estimate (and hence the signal-to-noise ratio) 
of the optimal method depends on the concrete LSS realization, 
and how well it is suited to detect the ISW effect. In a universe, 
where by chance the local LSS does/does not permit a good ISW 
detection, the error is small/large, as it should be. 

We would like to point out that in the optimal method there is 
no need to estimate the covariance matrices from Monte Carlo sim- 
ulations, since for a given set of cosmological parameters, the mat- 
ter covariance matrix (power spectrum) S can be calculated analyt- 
ically using the fitting formula provided bv Bardeen et al. ( 19860 , 
since it is still linear on the scales we are interested in. Cprim can be 
obtained from Boltzmann codes such as CMB EASY, and the noise 
covariance A*' can be estimated from the data. 



5.3 Comparison of signal-to-noise ratios and biasing 

In order to compare our method to the standard one, let us now 
again make the simplifying assumption that there is no shot noise 
in the galaxy distribution, and that we have a perfect galaxy survey, 
as we did in section |42l At the end of this section, we will also ap- 
proximately look at the effects of a galaxy survey that is incomplete 
in redshift, i.e. that goes out to a maximal redshift Zmax < 2. For 
the perfect survey, the shot noise covariance matrix A'^ is zero, and 
hence the posterior for 5m in eq. l |53| l is infinitely sharply peaked 
around the reconstruction Sm'^ (eq.l54t. which turns into 

C = {R^N-^R)-^R^N-^5g 

= R'^5g. (66) 
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Figure 2. Probability distribution of the signal-to-noise ratio in the optimal 
method (solid) for imax = 100 and Zmax = 2. The vertical line (dashed) 
shows the signal-to-noise ratio of the standard method for comparison. 



Figure 3. Average signal-to-noise ratio of the optimal method (solid) and 
signal-to-noise ratio of the standard method (dashed) versus Zmax for 
imax = 100. 



Here, R ^ should be read as the pseudo-inverse of fl, e.g. as de- 
fined in terms of Singular Valu e Decomposition (see IPress et al.l 
( [1992) and Zaroubi et all ( Il995l) ). 

The posterior for Sm. in eq. ( 153b is therefore now a Dirac delta 
function 



Pi5m\Sg,p)=S{Sm-R^^Sg), 



(67) 



which makes our ISW template exact, and the noise covariance ma- 
trix due to the error in the reconstruction is zero, VDV^ — 0, 
hence leaving us with C — Cprim ~ Ccmb — Cisw. Since our 
perfect LSS survey covers the complete volume relevant for the 
ISW, our template is now equal to the ISW-temperature fluctua- 
tions, r — Tisw- We can then substitute all indices r in eqs l |61| )- 
( 165 1 by the index ISW, and the estimated amplitude becomes 



Ar^ 



E,(2« + i) 



rSW.CMB 

TSVT 






(68) 



with the variance 



al={Y^{2l + l)^ 



_ (jlSW 



and the signal-to-noise ratio 



(#),-!»' + « 



(^ISW 



(^CMB 



(jlSW ■ 



(69) 



(70) 



As we mentioned before, the variance, and hence the signal- 
to-noise ratio of the optimal method, depend on the actual real- 
ization of the matter distribution in our observed Universe. In Fig. 
[21 we plot the probability distribution of our signal-to-noise ratio 
for Zmax = 100 and Zmax = 2, which we have inferred from the 
distribution of Tisw using the central limit theorem for (S/N)^, 
and from that deriving the distribution for (S/N)^. We have also 
checked the validity of the central limit theorem in this case by 
comparing with the correct probability distribution of the signal- 
to-noise ratio g iven by an expansion into Laguerre po l ynom ials as 
derived e.g. in ICastaiio-Martinez & Lopez-Blazquej ( 120050 . The 
probability distribution is such that the signal-to-noise ratio can 
easily differ by A (S/N)^ ~ 1 for two different realizations of 
the matter distribution. 



The mean signal-to-noise ratio (S/N)'!^ = l/^cr?_av = 
l/-v/(cr|)isw increases with Zmax, as it did for the standard 
method. For every Imax we compare the mean signal-to-noise ratio 
of the optimal method to the signal-to-noise ratio of the standard 
method (cf. eq.|43ll in the top panel of Fig.[T] again for Zmax ~ 2. 
Note that in our formula for the signal-to-noise ratio, eq. jlOh there 
is now a minus sign between Ci^ and Cj , in contrast to the 
signal-to-noise ratio of the standard method in eq. ( 143 | l. which has 
a plus sign instead. Thus we take advantage of the LSS instead 
of moving it into the error budget. The absolute enhancement of 
the signal-to-noise ratio in our method is therefore independent of 
Zmax, for the main advantage of working conditional on the LSS 
arises on the very large scales, where the contribution of the ISW 
to the CMB is highest. The average relative improvement of the 
signal-to-noise is depicted in the middle panel of Fig.[T] It amounts 
to about 7 per cent for Zmax ~ 100. In the bottom panel of Fig. 



[T] we compare the mean relative improvement {a^ 



M< 



of the variance in the optimal method with the contribution of the 
local to the total variance in the standard method. The variance is 
reduced by about 13 per cent in the optimal method, as compared 
to the standard method. 

Note that the maximal average signal-to-noise ratio we can 
hope for when trying to detect the ISW via cross-correlation, given 
a perfect LSS survey, is {S/N)'^ ~ 7.3, with a variance as de- 
picted in Fig.|2| Hence, if we are lucky and live in an environment 
that allows for a high signal-to-noise ratio, we can maximally ob- 
tain a detection significance of about (7.5 — 8)a. 

Let us now look at the effect of an incomplete galaxy sur- 
vey. Incomplete galaxy surveys can be treated generically with our 
method, because the dark matter field, and hence the ISW effect, are 
split into a known part (the reconstruction) and an unknown part (an 
additive noise term uncorrelated with the reconstruction). However, 
for now we only want to give a rough estimate of the consequences 
of an incomplete survey. Therefore we introduce a sharp cut-off in 
redshift, Zmax, and we simply redefine Tisw = Tisw(< Zmax) to 
be the part of the ISW effect created at z < Zmax. The part of the 
ISW that has been created at 2 > 2;max is then considered part of 
the primordial temperature fluctuations Tprim. The power-spectra 
C; and Cf"'^ are redefined accordingly. With this redefinition 
we have introduced a correlation between what we consider the 
ISW and primordial fluctuations, which we would not have if we 
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had used the reconstruction for redefining Tisw- However, for get- 
ting the picture, we ignore this subtlety for the moment. 

In Fig. [3] we plot the signal-to-noise ratio of the standard 
method together with the average signal-to-noise ratio of the op- 
timal method versus 2;max for Zmax = 100, where we have used the 
above-described redefinition of C]^^ in eqs MQ\ and ([43}. With 
decreasing maximal redshift of the LSS survey, the total signal- 
to-noise ratio in both methods goes down, as does its relative en- 
hancement of the optimal method as compared to the standard one. 
Also the relative contribution of the local to the total variance in the 
standard method goes down with decreasing survey depth. 

As we stated in section l42l the amplitude-estimate of the stan- 
dard method is biased when the averaging is performed conditional 
on the galaxy-data 5g. This leads to an over- or underestimation 
of the detection significance, for the estimated amplitude is used 
when estimating the signal-to-noise ratio from the data. As we have 
shown, the contribution of the local to the total variance of the es- 
timator is quite small, about 1 1 per cent for an ideal galaxy survey 
and even smaller for a shallower survey. However, we could be un- 
lucky and live in an unlikely realization of the matter distribution, 
given the power spectrum, which would enhance the effect of the 
biasing. 

With the method we presented in this work, the local variance 
effect is reduced. If we knew the local matter distribution perfectly, 
we would not be affected by local variance at all, as we have shown. 
Unfortunately, we have to rely on reconstructions of the matter dis- 
tribution from LSS surveys, which suffer from shot noise, and the 
effects of mask and selection function. However, the reconstruction 
treats mask and selection function in an optimal way, and extracts 
the maximum amount of information from the LSS data which can 
then be used in the ISW detection. 



6 LIKELIHOOD FUNCTION FOR COSMOLOGICAL 
PARAMETERS 

The above-described biasing effect is of course also present when 
moving from the pure detection of the ISW to the task of constrain- 
ing cosmological parameters using the ISW, which has been at- 
tempted in many of the above-mentioned cross-correlation studies. 
This problem can already be seen in the likelihood function for the 
cosmological parameters of the standard method in eq. ( I32l (. The 
estimator of the cross-correlation function ^g, cmb could be quite 
different from the theoretical prediction with the underlying param- 
eter values, just because we are living in an unlikely realization of 
the matter distribution, given the power spectrum. Then the likeli- 
hood in eq. ([32} would favour cosmological parameter values for 
which the theoretical prediction of the cross-correlation function is 
closer to its estimator, hence biasing the parameter estimates. 

Furthermore, to our knowledge, there is no straightforward 
way of combining the likelihood from the cross-correlation in eq. 
( 132^ with the likelihoods for CMB and LSS data, as e.g. given 
bv Fverde et all J2003h , |Percival et alj (|2004J) and lCole et al.l l2005h . 
Usually, when combining CMB with LSS data for deriving con- 
straints on cosmological parameters, it is assumed that the two 
datasets are stochastically independent, i.e. that P(T, &„ 



P( T\p) P(Sg I p) [ cf. Teg mark et al. (2004), Sperg el et alj jlOOJi) 
and iKomatsu et all ( |2008|) 1. But the ISW effect (and also other 
effects as e.g. the Sunyaev-Zel'dovich-effect) introduces a small 
stochastical dependence of the CMB data on the LSS data. That is, 
instead of assuming that the joint likelihood factorizes, one should 
consider 



P{T,5,\p)^PiT\5g,p)P{5,\p), (71) 

in which we insert eq.s ([58} and ([52}, obtaining 
P{T,Sg\p) = GiT-T,C)G(Sg,RSR'' +N), 

= P{T\p)P{Sg\p)Q{T,5g\p), (72) 

and we recall for convenience the definition of C, eq. ( |59t , 

C = VDP'' +Cprim+Ci,t, 

of D, eq. ([50}, 

D= {r''N-^R + S-^)~\ 
and of T, eq. ([56}, 

T = P5„, . 

In the last step in eq. ([72}, we have expressed the joint likelihood in 
terms of the likelihoods P{T \ p) and P{5g \ p) for only CMB and 
only LSS data, respectively, and the coupling term 



Q{T,5g\p) 



P{T\5g,p) 

P{T\p) 
G{T - r, C) 



(73) 



G{T, Ccmb) 

Eq. l l72t is the generic expression for the joint likelihood 
P{T, Sg I p) for the cosmological parameters p, given CMB 
and LSS data, consistently including the small coupling term 
Q{T,5g \p) between the two datasets introduced by the ISW ef- 
fect. The quantities depending on the cosmological parameters are 
S, Cprim, P, R and, in general, A'^. Multiplying the likelihood 
by a prior P(p) for the cosmological parameters, one can then 
sample the parameter space and derive constraints on the cosmo- 
logical parameters from the posterior distribution P{p\T,5g) oc 

PiT,5g\p)P{p). 

Note that this likelihood function remains valid if galaxy bias 
variations, position dependent noise, and other non-linear effects of 
galaxy formation are taken into account, as long as the variance of 
the reconstructi on D = {{S'j^'^ — 5 m) (<5m'^ — 3m.) ) is estimated 
consistently (see lEnBlin et alj ( 120081) for methods to treat such com- 
plications). 



7 CONCLUSIONS 

Due to the obscuration by primordial CMB fluctuations, the detec- 
tion of the integrated Sachs-Wolfe effect remains a challenge, and 
has to be performed by cross-correlating the CMB signal with the 
large-scale structure. The standard method for doing so involves 
comparing the observed cross-correlation function to its theoreti- 
cal prediction, which is by construction an ensemble average over 
all realizations of the primordial CMB fluctuations and matter dis- 
tributions. Hence, the realization of the matter distribution in our 
Universe acts as a source of systematic noise in the estimate of the 
cross-correlation function, an effect that we have named the local 
variance. 

Since the ISW is only present on the largest scales, the effect 
of the local variance is quite notable, amounting to about 1 1 per 
cent of the total variance in the standard method for an ideal LSS 
survey. This leads to a biased estimated detection significance of the 
cross-correlation, and when moving from the pure ISW detection to 
parameter estimation, it also biases the parameter constraints. We 
note that even if the local variance contributes only about 1 1 per 
cent to the total variance of the detected signal, we could be unlucky 
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and live in an unlikely realization of the matter distribution, given 
the power spectrum. This would enhance the effect of the bias on 
the detection significance and parameter constraints. 

Given that information about the matter distribution can be 
inferred from the LSS survey, the local variance can be reduced 
by working conditional on this information. In this work, we have 
presented a generic technique of how to include the knowledge of 
the matter distribution into ISW detection in an optimal way, hence 
reducing the effect of the local variance. This optimal method re- 
quires a three-dimensional Wiener filter reconstruction of the LSS, 
including an estimator of the full reconstruction uncertainty co- 
variance matrix. Note that also other reconstruction techniques that 
provide an estimator of the uncertainty covariance can easily be in- 
cluded into our method. The reduction of the local variance stresses 
the importance to measure and reconstruct the LSS to the highest 
possible accuracy, as aimed by Kitaura & EnBlin (2008) and Ki- 
taura et al. (in preparation). 

The conditionality on the LSS data results in a dependence of 
the variance in the detected signal on the actual realization of the 
matter distribution in the observed Universe. The average variance 
in the optimal method is reduced by about 13 per cent as compared 
to the standard method, again in the case of an ideal LSS survey. 
The reduction of the noise translates into an average enhancement 
of the signal-to-noise or detection significance by about 7 per cent 
for the optimal method. However, note that also the signal-to-noise 
ratio depends on the actual realization of the matter distribution. 

We would also like to point out that in the optimal method, 
there is no need to estimate the covariance matrix by Monte 
Carlo simulations, which safes time and increases the accuracy of 
the method (using 1000 Monte Carlo simulations to estimate the 
standard covariance matrix of the cross-correlation f unction only 
reache s an accuracy of about 5 per cent, as stated bv lCabre et al.l 

ioo^)). 

In order to consistently include the information encoded in 
the ISW effect for deriving cosmological parameter constraints, 
we have derived the joint likelihood P{T, 5g \ p) for the cosmo- 
logical parameters p, given CMB and LSS data within the linear 
regime of structure formation. If one wishes to use the ISW effect 
for constraining cosmological parameters, one should include the 
additional CMB-galaxy data coupling term Q{T, Sg \p), which we 
have factored out in eq. l l72t . into the usual likelihood analysis. 
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APPENDIX A: PROOF OF THE EQUIVALENCE OF THE 
NUMBER OF BINS 

We now outline the proof that if one uses the correct kernel, i.e. 
the ISW kernel rather than the kernel for the galaxy density con- 
trast in the analysis, the estimated amplitude Ace and the variance 
aic are independent of the number of bins chosen, provided that all 
bins together cover the whole volume relevant for the ISW effect. 
The proof here is done only for the variance, but follows the same 
scheme for the estimated amplitude. The total variance a^^ one ob- 
tains when working with A*' bins is given by eq. l |38t , where we 
have substituted the index gi by ISW(i), following the argument of 
sectionl4!2l 



^(2Z + 1)^ cISW(.),CMB^-l(.^^.^^ISW(,).CMB 



.(Al) 



We then use the form of the covariance matrix given by eq. l l34t 
and the following relations that only hold for the ISW kernel: 



^ISW(i),CMB _ V^ ^ISW(i),ISW(j) 



C 



N 



ISW(j),CMB 



(A2) 



(A3) 



Now we choose a fixed but arbitrary number of bins A', invert the 
covariance matrix and by inserting the above relations we obtain 



V^ ^ISW(i),CMB^-l/ 
i-j 



-•■,m 



ISW(i),CMB 



(^ISW _|_ (^CMB 



.(A4) 



Inserting this into eq. JAlb . the resulting formula for a^c is exactly 
what we obtain from one single bin covering the whole volume 
relevant for the ISW effect. We have checked this explicitly for 
N = 2.. 5 and it is straightforward, though timely, to also check it 
for any other number of bins. 



xexp(--Si^S ^S„ 



with 

P{Sg,5^\p) = GiS,~RS^,N)G{S^,S) 
_ 1 

y/\2TvN\\2nS\ 

X exp (-^(^9 -RSr^yN-^S, - E<5™)) 

(Bl) 
Let us first rewrite the exponent 

= Sl D-^5m - 2 jt 5,„ + 5l AT-i 5g 

= {5^ - Dj)^D-H5„, - Dj) - fDj + Sl N-'Sg 

= {5^ ~ Dj^D-^S^ - Dj) + SliRSR"" + Ny^Sg, (B2) 

where we have used the definitions of D and j, eqs ( I50t and l l51b . 
in the first step, then completed the square in the second step, and 
we will separately prove the last step as Lemma 1 in the next sub- 
section. After doing that we will prove that 

|27rAfl|27rS'| = \2ivD\\2Tr{RSR'' + N)\, (B3) 

which we name Lemma 2, allowing us to reformulate eq. l IBU as 

1 



y/\2-KD\\2n{RSR-i + N)\ 

X exp (-^(<5m - DjyD-\S„, - Dj) 

X exp (-i ^t {RSR'' + N)-^5g^ , 
which is what we claimed in eq. J49b . 



(B4) 



Bl Lemma 1 

In this subsection we prove that 

j'^Dj - sl N-^Sg = ^5l (RSR'' + N)-^Sg. (B5) 

In order to simplify the notation, let us introduce 
M = R'^N-^R. (B6) 

It can be easily seen that eq. jBSl is equivalent to 

N-^R{S~'^+M)~'^R'^ N'^-N'^ = -{RSR^ +N)-^ (B7) 

by inserting the respective expressions for D and j. We start with 
eq. ( IB7t and transform it into an equation which is true. 

N-'^R{S~'^ + M)-^R^ N'^ ~N'^ = ~{RSR^ + iV)"^ 
^^ RSR^ N'^ R{S~^ + M)-^ R' N'^ 

+R(S'^ + M)-^R'N-^ - RSR'N-^ - 1 = -1 
^^ RS[M(S'^ + M)'^ + (1 + MS)'^ - 1]R^N'^ = 
^^ RS[MS{1 + MS)'^ + (1 + MS)'^ - l]R^N-^ = 
^^ RS[{1 + MS){1 + MS)'^ - l]R'N-^ = 0. (B8) 

This equation is true, QED 



APPENDIX B: DERIVATION OF THE JOINT 
PROBABILITY DISTRIBUTION 

In this section we will derive in detail the expression for the joint 
probability distribution P{5g, 5m \p) given in eq. ( |49t . We start 



B2 Lemma 2 

In the following we prove that 

|27riV||27r5'| = \2tvD\\2tv{RSR'' + N)\, 



(B9) 
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which is equivalent to 

\N\\S\ = \D\\RSR' +N\, 



(BIO) 



for the factors of 2-n cancel for matrices that operate on the same 
vector space. Let us write 

\N\\S\ 



\D\ 



= l^lis||^-'| 



N\\SD-^\ 

N\\S{S-^ +R''N-^R)\ 
iV|exp(log|l + S'i?tiV"'i?|) 
iV| exp (Tr log(l + SR''"N~'R)) 
iV| exp (Tr log(l + RSR'''N"^)) 
iV|exp(log|l+i?S'-R''"Af~'|) 
NWRSR^'N-^ + 1\ 
(RSR'^N-^ + 1)N\ 
RSR'^ +N\. 



(BU) 



The crucial step here was to use the cyclic invariance of the trace Tr 
and to notice that this cyclic invariance still holds for the trace of a 
logarithm, which can be easily verified using the Taylor expansion 
of the logarithm. 
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